Supported File Types for Content Inspection and OCR
The following table lists file types recognized by Cyberhaven's content inspection services and shows whether each format supports text extraction (content scanning) and optical character recognition (OCR).
| File Type | Content Scanning | OCR |
|---|---|---|
js | Yes | No |
jpg | No | Yes |
py | Yes | No |
png | No | Yes |
pyc | No | No |
ts | Yes | No |
pdf | Yes | Yes |
json | Yes | No |
c | Yes | No |
map | Yes | No |
h | Yes | No |
class | No | No |
java | Yes | No |
txt | Yes | No |
html | Yes | No |
pyi | Yes | No |
jpeg | No | Yes |
svg | No | Yes |
md | Yes | No |
heic | No | No |
strings | Yes | No |
mp4 | No | No |
csv | Yes | No |
tsx | Yes | No |
dat | Yes | No |
mov | No | No |
xml | Yes | No |
properties | Yes | No |
xlsx | Yes | No |
loopdata | No | No |
ithmb | No | No |
yaml | Yes | No |
css | Yes | No |
php | Yes | No |
mjs | Yes | No |
yml | Yes | No |
jar | Yes | No |
tsv | Yes | No |
kt | Yes | No |
zip | Yes | No |
docx | Yes | No |
plist | Yes | No |
rb | Yes | No |
ri | Yes | No |
gif | No | Yes |
log | Yes | No |
wav | No | No |
inc | Yes | No |
mp3 | No | No |
tif | No | Yes |
sql | Yes | No |